Granular Classification for Imbalanced Datasets: A Minkowski Distance-Based Method
نویسندگان
چکیده
The problem of classification for imbalanced datasets is frequently encountered in practical applications. data to be classified this are skewed, i.e., the samples one class (the minority class) much less than those other classes majority class). When dealing with datasets, most classifiers encounter a common limitation, that is, they often obtain better performances on class. To alleviate study, fuzzy rule-based modeling approach using information granules proposed. Information granules, as some entities derived and abstracted from data, can used describe capture characteristics (distribution structure) both classes. Since geometric depend distance measures granulation process, main idea study construct each Minkowski then establish models by “If-Then” rules. experimental results involving synthetic publicly available reflect proposed distance-based method produce series shapes granular satisfying performance datasets.
منابع مشابه
Classification in Imbalanced Datasets
In this thesis we study the classification task in the presence of class imbalanced data. This task arises in many applications when we are interested in the under-represented (minority) classes. Examples of such applications are related to fraud detection, medical diagnosis and monitoring, text categorization, risk management, information retrieval and filtering. Although there exist many stan...
متن کاملHybrid classification approach for imbalanced datasets
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi CHAPTER
متن کاملMargin-Based Over-Sampling Method for Learning from Imbalanced Datasets
Learning from imbalanced datasets has drawn more and more attentions from both theoretical and practical aspects. Over-sampling is a popular and simple method for imbalanced learning. In this paper, we show that there is an inherently potential risk associated with the oversampling algorithms in terms of the large margin principle. Then we propose a new synthetic over sampling method, named Mar...
متن کاملOversampling Method for Imbalanced Classification
Classification problem for imbalanced datasets is pervasive in a lot of data mining domains. Imbalanced classification has been a hot topic in the academic community. From data level to algorithm level, a lot of solutions have been proposed to tackle the problems resulted from imbalanced datasets. SMOTE is the most popular data-level method and a lot of derivations based on it are developed to ...
متن کاملA Novel One Sided Feature Selection Method for Imbalanced Text Classification
The imbalance data can be seen in various areas such as text classification, credit card fraud detection, risk management, web page classification, image classification, medical diagnosis/monitoring, and biological data analysis. The classification algorithms have more tendencies to the large class and might even deal with the minority class data as the outlier data. The text data is one of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Algorithms
سال: 2021
ISSN: ['1999-4893']
DOI: https://doi.org/10.3390/a14020054